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Abstract 

Gene assembly in ciliates is an extremely involved DNA transforma- 
tion process, which transforms a nucleus, the micronucleus, to another 
functionally different nucleus, the macronucleus. In this paper we char- 
acterize which loop recombination operations (one of the three types of 
molecular operations that accomplish gene assembly) can possibly be ap- 
plied in the transformation of a given gene from its micronuclear form 
to its macronuclear form. We also characterize in which order these loop 
recombination operations are applicable. This is done in the abstract and 
more general setting of so-called legal strings. 



1 Introduction 

Ciliates are a large group of one-cellular organisms having two functionally 
different nuclei: the micronucleus and the macronucleus. An involved DNA 
transformation process called gene assembly transforms a micronucleus into a 
macronucleus. The process is accomplished using three types of DNA transfor- 
mations, which operate on special DNA sequences called pointers. These three 
types of operations are called loop recombination, hairpin recombination, and 
double-loop recombination. 

For every gene in its micronuclear form, there can be several sequences of oper- 
ations to transform this gene to its macronuclear form. We call such a sequence 
a strategy. For a given micronuclear gene strategies may differ in the number of 
hairpin and double-loop recombination operations. It has been shown that the 
number of loop recombination operations is independent of the chosen strategy 
0J|3], and that this number can be efficiently calculated pp. 

In this paper we further investigate the loop recombination operation, called the 
string negative rule in the string pointer reduction system, a formal model of 
gene assembly introduced in 0. We characterize for a given set of pointers D, 
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whether or not there is a strategy that applies loop recombination operations on 
exactly these pointers. We show that this characterization implies an efficient 
algorithm that determines this for given D. Also, we characterize the order in 
which the pointers of D can possibly be applied in strategies. These results are 
obtained using the reduction graph, a graph similar to the breakpoint graph in 
the theory of sorting by reversal, introduced in pp. 

This paper is organized as follows. In Section [2] we recall basic notions and 
terminology concerning mainly strings and graphs, and in Section we recall a 
formal model of the gene assembly process: the string pointer reduction system. 
In Section 2| we recall the notion of reduction graph and some theorems related 
to this notion. In Section we define the pointer-component graph, a graph 
that depends on the reduction graph, and we discuss a natural operation on 
this graph. In Section El we show that spanning trees of the pointer-component 
graphs reveal interesting properties concerning the string negative rule. Section^ 
shows that merging and splitting of vertices in pointer-component graphs relate 
to the removal of pointers. Using the results of Sections and [7| we characterize 
in Section[S]for a given set of pointers D, whether or not there is a strategy that 
applies string negative rules on exactly these pointers. Section strengthens 
results of Section |H1 by also characterizing in which order the string negative 
rules can be applied on the pointers. We conclude this paper with Section ITUl 

2 Basic Notions and Notation 

In this section we recall some basic notions concerning functions, strings, and 
graphs. We do this mainly to fix the basic notation and terminology. 

The composition of functions / : X — > Y and g : Y — > Z is the function 
gf : X — * Z such that (gf)(x) = g(f(x)) for every x 6 X. The restriction of / 
to a subset A of X is denoted by f\A, and for D C X we denote by f[D] the 
set {f(x) \ xeD}. 

We will use A to denote the empty string. For strings u and v, we say that v is a 
substring of u if u — W\VW2 1 for some strings W\, u>2; we also say that v occurs 
in u. 

For alphabets E and A, a homomorphism is a function ip : E* — > A* such that 
<p{xy) = ip(x)tp(y) and for all x,y G E*. Let ip : E* — > A* be a homomorphism. 
If there is a V C E such that 



then tp is denoted by eraser- 

We now turn to graphs. A (undirected) graph is a tuple G — (V,E), where V is 
a finite set and E C {{x,y} \ x,y G V}. The elements of V are called vertices 
and the elements of E are called edges. We allow x = y, and therefore edges can 
be of the form {x, x} = {x} — an edge of this form should be seen as an edge 
connecting x to x, i.e., a 'loop' for a vertex. 

Isomorphisms between graphs are defined in the usual way. Two graphs G — 
(V,E) and G' — (V',E') are isomorphic, denoted by G w G", if there is a 
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bijection a : V — > V such that {a;, J/} € E iff {a(a;), G I?', for all ar, y G V. 

A wa/fc in a graph G is a string 7r = eie2 • • • e n over E with rt > 1 such that there 
are vertices Xi,X2, ■ ■ ■ , x n +i where = {2^,2^+1} and e^+i = {xj+i, 0^+2} f° r 
1 < i < n (allowing {x,x} = {x} for all vertices x). We then also say that 7r is 
a walk from x\ to x n +\ or a walk between x\ and x n +i. We say that walk tt is 
simple if xi 5^ CEj for l<i<j<n+l.A walk from v to i> for some v G V is 
called a ct/de. Note that a loop is a cycle. We say that G is acyclic if there are 
no cycles in G. We say that G is connected if for every two vertices v\ and «2 
of G with v\ ^ U2, there is a walk from to V2- We say that G is a iree, if it 
is a connected acyclic graph. If we fix a certain vertex of the tree as the root, 
then the usual terminology of trees is used, such as the father of a vertex, and 
a child of a vertex, etc. Vertex x is isolated in G if there is no edge e of G with 
x G e. The restriction of G to E' C _E, denoted by G|s', is (V, E'). 

Graph G' = (V, E') is an induced subgraph of G if V C V and £' = £n{{ir, y} | 
G V'}. We also say that G' is the subgraph of G induced by V . A subgraph 
H of G induced by Vjj C V" is a connected component of G if H is connected, 
and for every edge e G B either e C or e C 

A (undirected) multigraph is a (undirected) graph G = (V,i?, e), where parallel 
edges are possible. Therefore, E is a finite set of edges and e : £ — > {{x, y} | 
x,y E V} is the endpoint mapping. Clearly, if e is injective, then such a multi- 
graph is equivalent to a (undirected) graph. We let Ti denote the set of undi- 
rected multigraphs. 

A 2-edge coloured graph is a (undirected) graph G = (V, E%, E2, /, s, t) where 
E\ and E2 are two finite (not necessarily disjoint) sets of edges, s, t G V are two 
distinct vertices called the source vertex and the target vertex respectively, and 
there is a vertex labelling function / : V\{s, t} — > T. The elements of T are the 
vertex labels. We use T2 to denote the set of all 2-edge coloured graphs. 

Notions such as isomorphisms, walks, connectedness, and trees carry over to 
these two types of graphs. For example, for undirected multigraph G = (V, E, e) 
and E' C E, we have G\e< = (V, E', e\E'). Care must be taken for isomorphisms. 
Two 2-edge coloured graphs G = (V, Ei,E 2 J, s, t) and G' = (V, E[,E' 2 ,f, s', if) 
are isomorphic, denoted by G ks G', if there is a bijection a : V — » V 
such that a(s) — s', a(t) = t', f(v) — f'(a(v)) for all v G V, and {x,y} G 
Ei iff {a(x), a(y)} G for all x, y G V, and i G {1, 2}. 

For 2-edge coloured graphs G, we say that a walk 7r = eie 2 • • ■ e„ in G is an 
alternating walk in G if, for 1 < i < n, both ei G £1 and e^+i G E2 or the other 
way around. 

3 String Pointer Reduction System 

Three (almost) equivalent formal models for gene assembly were considered in 
0I21E]- In this section we briefly recall one of them: the string pointer reduction 
system. This is done mainly to fix the notation and terminology associated with 
this model. For a detailed motivation and other results concerning this model 
we refer to [3]. We continue to use the string pointer reduction system in the 
remainder of this paper. 
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We fix k > 2, and define the alphabet A = {2, 3, ... , k}. For D C A, we define 
D = {a | a G D} and II = A U A. The elements of II will be called pointers. 
We use the 'bar operator' to move from A to A and back from A to A. Hence, 
for p G II, p — p. For a string u — x\X2 ■ • • x n with Xi G II, the inverse of u is 

{p if p £ A . 
- , i.e., p is 
p if p G A 

the 'unbarred' variant of p. The domain of a string « G II* is dom(v) = {p | 
p occurs in u}. A legal string is a string u G II* such that for each p G II that 
occurs in u, u contains exactly two occurrences from {p,p}. 

Definition 1 

Let u = x\Xi ■ ■ ■ x n be a legal string with a;, G II for 1 < i < n. For a pointer 
pell such that {xi,Xj} C and 1 < i < j < n, the p-interval of u is 

the substring XiXi+i ■ ■ ■ Xj. Two distinct pointers p, q G II overlap in u if both 
q G dom(Ip) and p G dom(I q ), where I p (J g , resp.) is the p-interval (g-interval, 
resp.) of u. m 

Example 

String u = 437743 is a legal string. However, v = 424 is not a legal string. Also, 
dom(u) = {3, 4, 7} and u = 347734. The 3-interval of u is 37743, and pointers 3 
and 4 overlap in u. 

The string pointer reduction system consists of three types of reduction rules 
operating on legal strings. For all p, q G II with p ^ q: 

• the string negative rule for p is defined by snr p (uippu2) = u\Ui, 



the string positive rule for p is defined by spr (U1PU2PU3) — U1U2U3, 



• the string double rule for p, q is defined by sdrp^(uipu2qu^pu4 : qu5) = 
U1U4U3U2U5, 

where u%, 112, . . . , U5 are arbitrary (possibly empty) strings over II. We also define 
Snr = {snr p | p G II}, Spr = {spr p p G 11} and Sdr = {sdr p , g | p, q G II, p ^ 
q} to be the sets containing all the reduction rules of a specific type. For a 
pointer p and a legal string u, if both p and p occur in u then we say that both 
p and p are positive in it; if on the other hand only p or only p occurs in u, then 
both p and p are negative in u. 

Note that each of these rules is defined only on legal strings that satisfy the 
given form. For example, spr^ is defined on legal string 2323, however spr 2 is 
not defined on this legal string. Also note that for every non-empty legal string 
there is at least one reduction rule applicable. Indeed, every non-empty legal 
string for which no string positive rule and no string double rule is applicable 
must have only non-overlapping negative pointers, thus there is a string negative 
rule which is applicable. This is formalized in Theorem 
Definition 2 

The domain of a reduction rule p, denoted by dom(p), equals the set of un- 
barred variants of the pointers that the rule is applied to, i.e., dom(snr p ) = 
dom(spr p ) = {p} and dom(sdr p ^ q ) = {p,q} for p,q G II. For a composi- 
tion tp = cp n ■ ■ ■ if2 ipi of reduction rules tpi, if2, ■ ■ ■ , the domain, denoted 
by dom(<p), is the union of the domains of its constituents, i.e., dom(ip) = 
dom((pi) U dom(ip2) U ■ • ■ U dom(ip n ). ■ 
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Example 

The domain of tp = snr-2 spr^ sdr75 snrg is dom(tp) = {2, 4, 5, 7, 9}. 
Definition 3 

Let S C {Snr, Spr, Sdr}. Then a composition tp of reduction rules from S is 
called an (S- Jreduction. Let u be a legal string. We say that tp is a reduction of u, 
if is a reduction and ^? is applicable to (defined on) it. A successful reduction tp 
of u is a reduction of m such that </?(u) = A. We then also say that tp is successful 
for u. We say that u is successful in S if there is a successful iS-reduction of u.u 

Note that if tp is a reduction of u, then dom{p) = dom(u)\dom(tp(u)). 
Example 

Again let u = 437743. Then tp\ = sdrj 3 spr 7 is a successful {Spr, Sdr}- 
reduction of u. However, both <p% = snr3 spr 7 and tp3 = snr§ are not reductions 
of u. 

Since for every (non-empty) legal string there is an applicable reduction rule, 
by iterating this argument, we have the following well known result. 

Theorem 4 

For every legal string u there is a successful reduction of u. 

4 Reduction Graph 

In this section we recall the definition of reduction graph and some results 
concerning this graph. First we give the definition of pointer removal operations 
on strings, see also pQ. 

Definition 5 

For a subset DC A, the Z?-removal operation, denoted by renin, is defined by 
rerun = erase Du jj. We also refer to remo operations, for all D C A, as pointer 
removal operations. u 

Example 

Let u = 543725627346 be a legal string. Then for D = {4,6,7,9}, we have 
remoiu) = 532523. In the remaining examples we will keep using this legal 
string u. 

Below we restate a lemma from The correctness of this lemma is easy to 
verify. 

Lemma 6 

Let u be a legal string and DC A. Let tp be a composition of reduction rules. 

1. If tp is applicable to remoiu) and tp does not contain string negative rules, 
then tp is applicable to u. 

2. If tp is applicable to u and dom(tp) C dom(u)\D, then p is applicable to 
rem£>(u). 

3. If tp is applicable to both u and rerriD (u) , then tp{rern£>(u)) = remu(p(u)). 
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U ^ ipxiu) 



remu(u) 




Figure 1: An illustration of Lemma HO 

Figure ^ illustrates Lemma El for the case where there is a successful reduction 
f = f2 fi of u, where ip± is a {Spr, Sdr}- reduction and ip2 is a {iSnrj-reduction 
with dom(if2) = D. 

We now restate the definition of reduction graph (see 0) in a less general form. 
We refer to yQ for a motivation and for more examples and results concerning 
this graph. The notion is similar to the breakpoint graph (or reality-and-desire 
diagram) known from another branch of DNA processing theory called sorting 
by reversal, see e.g. [8] and [7]. 

Definition 7 

Let u — P1P2 ■ ■ ■ Pn with p\, . . . , p n G II be a legal string. The reduction graph 
of u, denoted by 1Z U , is a 2-edge coloured graph (V, Ei, E%, f, s, t), where 

v = {i u i 2 ,...,i n } u {/(,4,. ..,i;> u { s ,t}, 

Ei = {e ,ei, . . . ,e„} with e { = for 1 < i < n,e = {s,7i},e„ = {I' n ,t}, 

^2= {{I'i, Ijh ih, I-} I i,3 G {1, 2, . . . , n} with i + 3 and Pi = Pj } U 

I h3 e {1,2, ...,n} &nd Pl =pj}, and 

f(Ii) = f{I[) = P l for 1 < i < n. 

m 

The edges of E\ are called the reality edges, and the edges of E2 are called the 
desire edges. Notice that for each p £ dom(u), the reduction graph of u has 
exactly two desire edges containing vertices labelled by p. 

In depictions of reduction graphs, we will represent the vertices (except for s 
and t) by their labels, because the exact identity of the vertices is not essential 
for the problems considered in this paper. We will also depict reality edges as 
'double edges' to distinguish them from the desire edges. 

Example 

We continue the example. Reduction graph 1Z U is given in Figure[2 and 7Z rem{2 7} 
is given in Figure [21 

Each reduction graph has a connected component with a linear structure con- 
taining both the source and the target vertex. This connected component is 
called the linear component of the reduction graph. The other connected com- 
ponents are called cyclic components because of their structure. 

The definition of reduction functions and the remaining results are also taken 
from pp. The p-reduction function removes vertices labelled by p and 'contracts' 
alternating walks via these vertices into a single edge. 
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s^=5 5^=6 6^=t 3 3 



5^=2 2^=6 3^=4 7 7 

5^=4 4^=6 3^=4 7 7 

2 2 

Figure 2: The reduction graph of u from the Example. 
s^=5 5^=6 6^=i 

4^=5 5^=3 3^=4 

4^=6 6^=3 3^=4 

Figure 3: The reduction graph TZ rem{2 7} („) from the Example. 
Definition 8 

For each vertex label p, we define the p-reduction function rf p : Y 2 — > T 2 , for 
G=(V,£i,£k,/,«,t)eT 2 ,ty 

tfp(G) - (V, (£7l\S r em) U E otH , £ 2 \£ rem , S, t), 

with 

V = {s,t}u{veV\{s,t}\f(v)^p}, 
Erem = {e G £a U -E 2 | f (x) = p for some x G e}, and 
E a dd — {{j/i: 2/2} | eie 2 • • • e n with n > 2 is an alternating walk in G 

with G ei, y 2 G e„, /(j/i) ^p^ /(j/2), and 

/(a:) = p for all x G e,, 1 < i < n}. 

m 

Reduction functions commute under composition. Thus, for a reduction graph 
Tlremo(u) an d pointers p and q, we have 

(rf g rf p )(K u ) = (rf p rf q )(U u ). 

Any reduction can be simulated, on the level of reduction graphs, by a sequence 
of reduction functions with the same domain. 
Theorem 9 

Let u be a legal string, and let ip be a reduction of u. Then 
(rf P n ■ ■ ■ r f P 2 rf Pl ){K u ) w Tl v(u) , 
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s = p p = q q = p p = q q = t 



Figure 4: The reduction graph of pqpq (and pqpq). 
where dom(ip) = {pi,p 2 , ■ ■ ■ ,p n }- 

The next lemma is an easy consequence from results in pQ. 
Lemma 10 

Let u be a legal string and let p S II. Then TZ U has a cyclic component C 
consisting of only vertices labelled by p iff either pp or pp is a substring of u. 
Moreover, if C exists, then it has exactly two vertices. 

One of the motivations for the reduction graph is the easy determination of the 
number of string negative rules needed in each successful reduction pQ. 

Theorem 11 

Let N be the number of cyclic components in the reduction graph of legal string 
u. Then every successful reduction of u has exactly N string negative rules. 

Example 

We continue the example. Since 1Z U has three cyclic components, by Theorem llll 
every successful reduction ip of u has exactly three string negative rules. For ex- 
ample <p — sm-6 snr4 snr2 spry sdrss is a successful reduction of u. Indeed, ip 
has exactly three string negative rules. Alternatively, snr@ sm"4 snr3 spr 2 spr 5 spr 7 
is also a successful reduction of u, with a different number of (spr and sdr) op- 
erations. 

The previous theorem and example should clarify that the reduction graph 
reveals crucial properties concerning the string negative rule. We now further 
investigate the string negative rule, and show that many more properties of this 
rule can be revealed using the reduction graph. 

However, the reduction graph does not seem to be well suited to prove properties 
of the string positive rule and string double rule. If we for example consider legal 
strings u — pqpq and v = pqpq for some distinct p, q £ II, then u has a unique 
successful reduction t/?i = spr^ spr p and v has a unique successful reduction 
if 2 = sdr Ptq . Thus u must necessarily be reduced by string positive rules, while 
v must necessarily be reduced by a string double rule. However, the reduction 
graph of u and the reduction graph of v are isomorphic, as shown in Figure 0] 
Also, whether or not pointers overlap is not preserved by reduction graphs. For 
example, the reduction graphs of legal strings pqpfqr and pqrpqr for distinct 
pointers p, q and r are isomorphic, however p and r do not overlap in the first 
legal string, but they do overlap in the latter legal string. 

The next lemma is an easy consequence of Lemma El and Theorem 1151 

Lemma 12 

Let u be a legal string, and let D C dom(u). There is a {Spr, S'drj-reduction ip 
of u with dom(tp(u)) = D iff 7t TemD (u) does not contain cyclic components. 
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Figure 5: The graph VC U from the Example. 



Proof 

There is a {Spr, Sdrj-rcduction tp of u with dom{tp{u)) — D iff there is a 
successful {Spr, Sdr}- reduction of remniu) (by Lemma |H} iff TZ remD ( u ) does 
not contain cyclic components (by Theorem If lj l. ■ 

We now consider the case where \D\is the number of cyclic components of 1Z U . 
Lemma 13 

Let u be a legal string, and let D C dom(u). There is a successful reduction 
tp = if2 <pi of u, where tp\ is a {Spr, 5<ir}-reduction and ip2 is a {Snr}- reduction 
with dom((p2) = D iff 1Z remD r u ^ and 7?. M have and |D| cyclic components, 
respectively. 

Proof 

We first prove the forward implication. By Lcmma lT^l TZ r emnM does not contain 
cyclic components. By Theorem If f I Tt re . mD {u) has \D\ cyclic components. 

We now prove the reverse implication. By Lemma |f 21 there is a successful re- 
duction tp — tp2 tp\ of u, where tpi is a {Spr, S'drj-reduction and dom{tp2) = D. 
Since 1Z U has \D\ cyclic components, by Theorem lf f I every pointer in D is used 
in a string negative rule, and thus tp2 is a {Snrj-reduction. ■ 

5 Pointer- Component Graphs 

If it is clear from the context which legal string u is meant, we will denote by £ 
the set of connected components of the reduction graph of u. We now define a 
graph on Q that we will use throughout the rest of this paper. The graph repre- 
sents how the labels of a reduction graph are distributed among the connected 
components. This graph is particularly useful in determining which sets D of 
pointers correspond to strategies that apply loop recombination operations on 
exactly the pointers of D. 

Definition 14 

Let u be a legal string. The pointer- component graph of u (or of 1Z U ), denoted 
by VC U , is an undirected multigraph (£, E, e), where E = dom(u) and e is, for 
e E E, defined by e(e) = {C £ ( \ C contains vertices labelled by e}. ■ 

Note that for each e € dom(u), there are exactly two desire edges connecting 
vertices labelled by e, thus I < |e(e)| < 2, and therefore e is well defined. 



Example 

We continue the example. Consider 1Z U shown in FigureEl Let us define C\ to be 
the cyclic component with a vertex labelled by 7, C2 to be the cyclic component 
with a vertex labelled by 5, C3 to be the third cyclic component, and R to be 
the linear component. Then £ = {C\, C2, C3, R}. The pointer-component graph 
VC U — (£, dom(u), e) of u is given in Figure 

We can use the definition of pointer-component graph to reformulate Theo- 
rem ^2 
Theorem 15 

Every successful reduction of a legal string u has exactly o(VC u ) — 1 string 
negative rules. 

For reduction ip of a legal string u, the difference between 1Z U and 1Z V ( U ) is 
formulated in Theorem [5] in terms of reduction functions. We now reformulate 
this result for pointer-component graphs. The difference (up to isomorphism) 
between the pointer-component graph PC\ of 1Z U and the pointer-component 
graph PC2 of rf p (lZ u ) (assuming rf p is applicable to 1Z U ) is as follows: in PC2 
edge p is removed and also those vertices v that become isolated, except when v 
is the linear component (since the linear component always contains the source 
and target vertex) . Since the only legal string u for which the linear component 
in VC U is isolated is the empty string, in this case we obtain a graph containing 
only one vertex. This is formalized as follows. By abuse of notation we will also 
denote these functions as reduction functions rf p . 

Definition 16 

For each edge p, we define the p-reduction function rf p : Ti — ► Ti, for G — 
(V,E,e)&T 1 ,by 

rf p (G) = (V',E',e\E'), 

where E' = E\{p} and V — {v £ V | v £ e(e) for some e S E'} if E' ^ 0, and 
V = {0} otherwise. ■ 

Therefore, these reduction functions correctly simulate (up to isomorphism) the 
effect of applications of a reduction functions on the underlying reduction graph 
when the reduction functions correspond to an actual reduction. Note how- 
ever, when these reduction functions do not correspond to an actual reduction, 
the linear component may become isolated while there are still other pointers 
present. Thus in general the reduction functions for pointer-component graphs 
do not faithfully simulate the reduction functions for reduction graphs. As a 
consequence of Theorem [5] we obtain now the following result. 

Theorem 17 

Let u be a legal string, and let ip be a reduction of u. Then 

( r f Pn ■■■ r f P 2 rf Pl )(VC u ) w {VC v(u) ), 
where dom(ip) = {pi,p 2 , ■ ■ • ,Pn}- 

Figure illustrates Theorems [5] and El 
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Wp„ ■■■ rf P1 ) 



n 



(rfpn ■■■ nfpl) 



■ PC 



ip(u) 



Figure 6: An illustration of Theorems 151 and IT7I 




Figure 7: Pointer-component graph PC\ from the Example. 
Example 

We continue the example. We have (snr4 sdr5,3)(u) = 627726. The pointer- 
component graph PC\ of this legal string is shown in Figured It is easy to see 
the graph obtained by applying (r/5 r/4 r/3) to VC U (Figure EJ) is isomorphic to 
PC X . 



6 Spanning Trees in Pointer- Component Graphs 

In this section we consider spanning trees in pointer-component graphs, and 
we show that there is an intimate connection between these trees and appli- 
cable strategies of string negative rules. The snr rules in a reduction can be 
'postponed' without affecting the applicability. Thus we can separate each re- 
duction into a sequence without snr rules, and a tail of snr rules. We often use 
this 'normal form' for notational convenience. First, we characterize {Spr, Sdr}- 
reductions in terms of pointer-component graphs. 

Theorem 18 

Let (p be a reduction of legal string u, and let D — dom(ip(u)). Then tp is a 
{Spr, SV^r {-reduction of u iff VC^^ ~ VC u \d- 

Proof 

We first prove the forward implication. By Theorem 01 there is a successful 
reduction ip' oi(p(u). By TheoremlTKl (<p' ip) has o(VC u ) — l string negative rules. 
Since <p is a {Spr, <!)dr {-reduction of it, ip' also has o(PC u ) — 1 string negative 
rules. Since ip' is a successful reduction of ip(u), by Theorem ED VC^^ has 
the same number of vertices as VC U . Therefore, by the definition of reduction 
function and Theorem 1 171 VC v t u -\ » VC u \d- 

We now prove the reverse implication. Let <p' be a successful reduction of p>(u). 
If VC V ( U ) w VC u \d, then VC V ( U ) has the same number of vertices as VC U . 
Then, by Theorem 1151 (<p' tp) has the same number of string negative rules as 
tp, therefore ip is a {Spr, Sdr {-reduction of u. m 
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It will be useful to separate loops from other edges in pointer-component graphs. 
Definition 19 

Let u be a legal string and let VC U = (V 7 E,e). We define snrdom(u) = {e G 
£||e(e)|=2}. 

Thus, snrdom(u) is the set of vertex labels p for which there are vertices labelled 
by p in different connected components of 1Z U . 

Example 

We continue the example. We have snrdom(u) = {2,3,4,5,6}, and dom(u) \ 
snrdom(u) — {7}. Indeed, the only loop in Figure [5] is 7, indicating that this 
pointer occurs only in one connected component of 1Z U . 

The following corollary to Theorem 1 1 71 observes that an edge in dom(ip{u)) is a 
loop in *PC v (y\ iff it is a loop in VC U . 

Corollary 20 

Let u be a legal string and if a reduction of u. Then snrdom(<p(u)) = dom{ip{u))d 
snrdom(u) = snrdom(u)\dom(tp). 

The examples so far have shown connected pointer-component graphs. We now 
prove that these graphs are always connected. 

Theorem 21 

The pointer-component graph of any legal string is connected. 
Proof 

Let (p be a successful reduction of a legal string u (<p exists by Theorem 0} . 
Assume that VC U is not connected. Since VC\ is connected, we have by Theo- 
rem El '■P = fiP i P\ for some reduction rule p, where VC Vl ^ is not connected, 
but VC pi p x i u \ is. By Theorem 1181 p cannot be a string double rule or a string 
positive rule, and therefore p = snr p for some p G LI. Since VC (pi ^ u ) is not 
connected, but TC PV1 ( U ) is, by the definition of reduction function, there is a 
connected component of VC Vl ^ u ) containing only the edge p. We consider two 
cases: p £ snrdom(u) and p 6 dom(u)\snrdom(u) . If p 6 snrdom(u), then 
^p^i(ti) would have two connected components less than TZ Vl ^ — a contradic- 
tion by Theorem 1 151 If p E dom(u)\snrdom(u) , then "PC ( p 1 r u \ has a connected 
component containing only a vertex and a loop — a contradiction by Lemma llOl 
Thus in both cases we have a contradiction, and therefore VC U is connected. ■ 

The next theorem characterizes successfulness in {Snr} using spanning trees. 
Theorem 22 

Let u be a legal string. Then u is successful in {Snr} iff VC U is a tree. 
Proof 

If u is successful in {Snr}, then, by Theorem El VC U has \C\ — 1 edges. By 
Theorem E2 it follows that VC U is a tree. 

If VC U is a tree, then VC U has \(\ — 1 edges. Since the number of edges is 
\dom{u)\, we have \dom(u)\ = \(\ — 1, and by Theorem 1 1 51 every p G dom(u) is 
used in a string negative rule, and thus u is successful in {Snr}. m 
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It turns out that the pointers on which string negative rules are applied in a 
successful reduction of u form a spanning tree of VC U . 

Theorem 23 

Let u be a legal string, and let D C dom(u). If there is a successful reduction 
ip = if2 fi of u, where ipi is a {Spr, Sdr}- reduction and ip% is a {Snrj-reduction 
with dom(ip2) — D, then VC u \d is a tree. 

Proof 

By Theorem E2 VC Vliu) is a tree. By Theorem IT5I VC ri /„.\ w "PC U | D - ■ 
Example 

We continue the example. We saw that ip = snrg snr4 snr2 spr=> sdrs.3 is a 
successful reduction of u. By Theorem 1231 "PCnl {2,4,6} is a tree. This is clear 
from Figure where VC U is depicted. 

In the next few sections we prove the reverse implication of the previous theorem. 
This will require considerably more effort than the forward implication. The 
reason for this is that it is not obvious that when VC u \d is a tree, there is a 
reduction <p\ of u such that D = dom((pi(u)). We will use the pointer removal 
operation to prove this. 

First, we consider a special case of the previous theorem. Since a loop can never 
be part of a tree, we have the following corollary to Theorem 1231 

Corollary 24 

Let u be a legal string and let p G dom(u). If snr p or snr^ is in a (successful) 
reduction of u, then p G snrdom(u) . 

Example 

We continue the example. Since tp = snrg sm"4 snr2 spr=> sdrs^ is a successful 
reduction of u, we have 2,4,6 G snrdom(u). 

We show in Theorem 1321 that the reverse implication of Corollary [^] also holds. 
Hence, the name snrdom(u) is explained: the pointers p € snrdom(u) are ex- 
actly the pointers for which snr p or snr^ can occur in a (successful) reduction 
of u. 



7 Merging and Splitting Components 

In this section we consider the effect of pointer removal operations on pointer- 
component graphs. It turns out that these operations correspond to the merging 
and splitting of connected components of the underlying reduction graph. First, 
we formally introduce the merging operation. 

Definition 25 

For each edge p, the p-merge rule, denoted by merge p , is a rule applicable to 
(defined on) G = (V, E, e) G Ti with p G E and \e(p)\ = 2. It is defined by 

merge p (G) = (V',E',e'), 

where E' = E\{p}, V = (V\e(p)) U {v'} with {«'} n V = 0, and e'(e) = 
{h(vi), h(v2)} iff e(e) = {vi,i>2} where h(v) = v' if v G e(p), otherwise it is the 
identity. ■ 
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Again, we allow both v\ = v 2 and h(v\) = h(i)2) in the previous definition. Intu- 
itively, the p-merge rule 'merges' the two endpoints of edge p into one vertex, and 
therefore the resulting graph has exactly one vertex less than the original graph. 
Note that p-merge rules commute under composition. Thus, if (merge q merge p ) 
is applicable to G, then 

(merge q merge p )(G) — (merge p merge q ){G). 

Theorem 26 

Let G — (V,E,e) £ T 1; and let D = {pi, ■ ■ ■ ,p n } ^ E. Then (merge Pn 
■ ■ ■ merge Pl ) is applicable to G iff G\d is acyclic. 

Proof 

(merge Pn ■ ■ ■ merge Pl ) is applicable on G iff for all Pi (1 < i < n), e{pi) % 
e[{pi, . . . ,Pi-i}} and |e(pj)| = 2 iff G\ D is acyclic. ■ 

One of the most surprising aspects of this paper is that the pointer removal 
operation is crucial in the proofs of the main results. The next theorem compares 
VC U with VC rern{p) ( u ) for a legal string u and p S dom(u). We distinguish three 
cases: either the number of vertices of VC rern{p) ( u ) is one less, is equal, or is one 
more than the number of vertices of VC U . The proof of this theorem shows that 
the first case corresponds to merging two connected components of 1Z U into one 
connected component, and the last case corresponds to splitting one connected 
component of 1Z U into two connected components. 

Theorem 27 

Let u be a legal string. 

• If p £ snrdom(u) , then VC rem[p} ^ u ) « merge p (VC u ) 
(and therefore o{VC rem{p}{u) ) = o(PC u ) - 1). 

• If p £ dom(u)\snrdom(u) , then o{VC u ) < o{VC rem{p ^ u )) < o(VC u ) + 1. 
Proof 

We first prove the p £ snrdom(u) case of the theorem. Then the two desire edges 
with vertices labelled by p belong to different connected components of 1Z U . We 
distinguish two cases: whether or not there are cyclic components consisting of 
only vertices labelled by p. 

If there is cyclic component consisting of only vertices labelled by p, then by 
Lemma ITUl pp or pp are substrings of u, and 1Z U is 



P^=P 




qi = p p = q 2 

where we omitted the parts of the graph that are the same compared to lZ r em {p} (u 
Now, TZ rem{p}{u ) is 

qi = 92 

Therefore VC rem{p} ^ u j can be obtained (up to isomorphism) from VC U by ap- 
plying the merge p operation. 
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Now assume that there are no cyclic components consisting of only vertices 
labelled by p. Then, 1Z U is 

9i = p p = 92 



q 3 = p p = 94 

where we again omitted the parts of the graph that are the same compared to 
Ti- r em {p} (u)- Now, depending on the positions of qi, . . . , 94 relative to p in u and 
on whether p is positive or negative in u, r R- r em {p] (u) is either 

<7i = q 4 

93 = 92 

or 

9i = 93 

94 = 92 

Since 91 and 92 remain part of the same connected component (the same holds 
for (73 and 94), the two connected components are merged, and thus 'PC renl{p} ^ 
can be obtained (up to isomorphism) from VC U by applying the merge p opera- 
tion. 

We now prove the p S dom(u)\snrdom(u) case. Then the two desire edges 
with vertices labelled by p belong to the same connected component of 1Z U . By 
Lemma ITU1 there are no cyclic components consisting of four vertices which are 
all labelled by p. We can distinguish two cases: whether or not there is a reality 
edge e connecting two vertices labelled by p. If there is such an reality edge e 
than 1Z U is 

91 = p p = p p = 94 

where we again omitted the parts of the graph that are the same compared 
to TZ-rem^^u)- Thus occurs precisely when pp or pp is a substring of u. Now, 

gi = q 4 

Therefore, 7^rem {p} (u) has N cyclic components. 
If there is no such a reality edge e, then 1Z U is 

9i = P P = 92 L 93 = P P = 94 

where L represents some (possibly empty) 'linear subgraph' of 1Z U , and where we 
again omitted the parts of the graph that are the same compared to 7t rem{p} ( u y 
Now, Ur em{py (u) is either 

94 = 92 L 93 = 9i 
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Figure 8: Reduction graph TZ rem {2 , („) from the Example. 

7 



3 o 5 




4 6 



Figure 9: VC rem , 2 ,^ from the Example. 

or 

L 




q 2 = q 3 qi = <? 4 

Therefore, H rem ( u ) has either A?" cyclic components (corresponding with the 
first case) or N + 1 cyclic components (corresponding with the second case). ■ 

Example 

We continue the example. By Theorem l27l we know from Figure[3that VC renl{2} (, 
merge2(VC u ). Indeed, this is transparent from Figures El & n d El where VC U , 
^•rem {2} («), and "PC re „ l{2} („) are depicted, respectively. 

Again by TheoremlTfl we know from Figure|Hlthat K rem{ („) has two or three 
cyclic components. Indeed, this is transparent from Figure|o| where lZ rerrl{2 7} ( u ) 
is depicted. 

Note that by the definition of merge p , merge p is applicable to VC U precisely 
when p s snrdom{u). Therefore, by Theorems 1261 and 1271 we have the following 
corollary. 

Corollary 28 

Let u be a legal string, and let D C dom(u). If VC u \d is acyclic, then 

T > C T em D {v.) ~ {merge Vn ■■■ merge Pl )(VC u ), 
where D = {pi, . . . ,p n }. 

8 Applicability of the String Negative Rule 

In this section we characterize for a given set of pointers D, whether or not 
there is a (successful) strategy that applies string negative rules on exactly these 



16 



C 3 — ^— d — C 2 R 



Figure 10: A subgraph of the pointer-component graph from the Example. 

Ci 

3 

C 3 — j— C 2 R 

Figure 11: A subgraph of the pointer-component graph from the Example. 

pointers. First we will prove the following result which depends heavily on the 
results of the previous section. The forward implication of the result observes 
that by removing pointers from u that form a spanning tree in VC U we obtain a 
legal string u' for which the reduction graph does not have cyclic components. 

Lemma 29 

Let u be a legal string, and let D C dom{u). Then VC u \d is a tree iff 7t r em D (u) 
and 1Z U have and \D\ cyclic components, respectively. 

Proof 

We first prove the forward implication. Let VC u \d be a tree. By Corollary 1281 
'PC r . emD ( u ) contains a single vertex. Thus lZ r em D (u) has no cyclic components. 
Since VC u \u is a tree, we have \D\ = |£| — 1. 

We now prove the reverse implication. Let 7Z rernD r u ) not contain cyclic com- 
ponents and \D\ — |£| — 1. By Theorem 1271 we see that the removal of each 
pointer p in D corresponds to a merge p operation, otherwise 7Z r em D (u) would 
contain cyclic components. Therefore, (merge Prl ■■■ merge Pl ) is applicable to 
VC U with D — {pi, . . . ,p n }. Therefore, by Theorem l26l VC u \d is acyclic. Again 
since \D\ = \C\ — 1, it is a tree. ■ 

Example 

We continue the previous example. Let D\ — {2, 3, 5} and D 2 — {2, 3, 4}. Then 
VC u \d 1 (VC u \d 2 , resp.) is given in Figure ITO1 (Figure ITT1 resp.). Notice that 
\Di\ = \D 2 \ = |C| — L Since 'PC u |_d 1 is a tree and VC u \d 2 is not a tree, by 
LemmaEHl it follows that "R- r em D {u) does not have cyclic components and that 
T^-remo M does have at least one cyclic component. This is illustrated in Fig- 
ures 02 and where H remDi ( u ) and TZ remD ( u ) are depicted respectively. 

s^=4 4^=6 6^=7 7 



t^=6 6^=7 7^=4 4 

Figure 12: The reduction graph r R- r em Dl (u) from the Example. 
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Figure 13: The reduction graph 7Z rern . D ( u ) from the Example. 

The next theorem is one of the main results of this paper. It improves The- 
orem 1231 by characterizing exactly which string negative rules can be applied 
together in a successful reduction of a given legal string. 

In this theorem we require that the string negative rules of ip are applied last. 
Recall that this is only a notational convenience since for every successful re- 
duction of a legal string, we can postpone the application string negative rules 
to obtain a successful reduction of the given form. 

Theorem 30 

Let u be a legal string, and let D C dom(u). There is a successful reduction 
if = (p 2 if i of u, where ip\ is a {Spr, 5dr}-reduction and f2 is a {Snrj-reduction 
with dom(ip2) = D iff VC u \u is a tree. 

Proof 

It directly follows from Lemma 1291 and Lemma IT^l ■ 

Since there are many well known and efficient methods for determining spanning 
trees in a graph, it is easy to determine, for a given set of pointers D, whether 
or not there is a successful reduction applying string negative rules on exactly 
the pointers of D (for a given legal string u). 

Example 

We continue the example. By Theorem 123 and Figure there is a success- 
ful reduction ip = ip2 Pi of u, for some {Spr, S'drj-reduction tpi and {Snr}- 
reduction (f2 with dom{<f>2) — {2,3,5}. Indeed, we can take for example <f = 
sm"5 snr2 snrg spry sdr^- 

By Theorem 1301 and Figure 1111 there is no successful reduction ip = if 2 <fi 
of u, where ip\ is a {Spr, 5*(ir}-reduction and (f2 is a {Snr }-reduction with 
dom((f2) — {2,3,4}. For example, (spr 5 spr 7 )(w) = 62342346 and thus there is 
no string pointer rule for pointer 6 applicable to this legal string. 

In the next corollary we consider the more general case \D\ < \(\ — 1, instead of 
\D\ = ICI - 1 in Theorem 

Corollary 31 

Let u be a legal string, and let D C dom{u). There is a (successful) reduction if 
of u such that for all p £ D, if contains either snr p or snr^ iff VC u \d is acyclic. 

Proof 

We first prove the forward implication. By Theorem 1301 VC u \d is a subgraph 
of a tree, and therefore acyclic. 
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We now prove the reverse implication. By Theorem PC U is connected, and 
since VC u \d does not contain cycles, we can add edges q € dom(u)\D from 
VC U such that the resulting graph is a tree. Then by Theorem 1301 it follows 
that there is a (successful) reduction tp of u containing either snr p or snr-p for 
all p e D. m 

The previous corollary with \D\ = 1 shows that the reverse implication of Corol- 
lary [31 also holds, since VC u \i p \ acyclic implies that the edge p connects two 
different vertices in VC U . 

Theorem 32 

Let u be a legal string and let p £ dom(u). Then snr p or snr^ is in a (successful) 
reduction of u iff p 6 snrdom(u). 

This theorem can also be proven directly. 
Proof 

To prove the reverse implication, let no reduction of u contain either snr p or 
snrp. We prove that p $ snrdom(u). By iteratively applying snr, spr and sdr 
on pointers that are not equal to p or p, we can reduce u to a legal string v such 
that for all q £ dom(v)\{p}: 

• qq and qq are not substrings of v. 

• q is negative in v. 

• q does not overlap with any pointer in dom(v)\{p}. 

If rem{ p y(v) = A, then v is equal to either pp, pp, pp or pp. If rem^ p j(v) =/= A, 
then, by the last two conditions, there is a q S II such that qq is a substring 
of rem^pj(v). Then, by the first condition, either qpq, qpq, qppq, qppq, qppq or 
qppq is a substring of v. 

Thus, either qpq, qpq, pp, pp, pp or pp is a substring of v. Since no reduction of 
u contains snr p or snr^, the last two cases are not possible. The first two cases 
correspond to the following part of 1Z V . 

P P q q P P ■■■ 

The cases where pp or pp is a substring of v correspond to the following part of 

P P P P ■■■ 

Consequently, in either case, the two desire edges of 1Z V with vertices labelled 
by p belong to the same connected component. Thus p g" snrdom(v). By Corol- 
lary |20| p ^ snrdom(u). m 

9 The Order of Loop Recombination 

In this section we strengthen Theorem |20l to allow one to determine exactly 
which orderings of string negative rules in ip2 are possible. First we introduce 
orderings in a tree, which is similar to topological orderings in a directed acyclic 
graph. Here we order the edges instead of the vertices. 
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Definition 33 

Let T = (V, E, e) be a tree. An edge-topological ordering ofT (with root R £ V ) 
is a linear order E' = (ei, e2, . . . , e n ) on B such that if e(ej) = {C x , C y }, e(ej) = 
{C y ,C z }, and C y (C z , resp.) is the father of C x (C y , resp.) in T considering R 
as the root of T, then j > i. m 

Example 

We continue the example. Consider again tree VC u \d-i shown in Figure ITU1 
Taking R as the root of VC u \d x , it follows that (3,2,5) is an edge-topological 
ordering of VC u \d x ■ 

The next theorem characterizes exactly the possible orders in which string neg- 
ative rules that can be applied in a successful reduction of a given legal string. 

Theorem 34 

Let u be a legal string, and let L = (j>i,P2, ■ ■ ■ ,Pn) be an ordered set over 
dom(u). There is a successful reduction 

•p = tPn+l snrp n tp n snr Pn _ 1 ■ ■ ■ ip 2 snr^ epi 

of u, for some (possible empty) {Spr, SVirj-reductions tpx, tp2, • ■ ■ , <p n +i an d Pi S 
{PiiPi} f° r 1 < i < n and n > iff PC u \l> is a tree with L' — {pi,p2, ■ ■ ■ ,p n } 
and L is an edge-topological ordering of VC u \l' with the linear component R of 
TZ U as root. 

Proof 

We first prove the forward implication. Clearly, we can postpone the application 
of string negative rules, thus snr^ snrp n _ 1 ■ ■ ■ snr^ <pf is also a successful 
reduction of u, where ip' = ip n +i fn • • ■ fi- By Theorem 1301 VC u \l' is a tree. 

We prove that L is an edge-topological ordering of VC u \l' with root R. By 
Theorem IT%1 VC v ir u \ ~ PC u \l'- If n > 0, then snr^ is applicable to f'{u). 
By Theorem El edge p% is connected to a leaf of VC v >( u y By Theorem H7I 
'PC( snr _ i v ')(u) is isomorphic to the graph obtained from VC V ^ U ) by removing 
Pi and its leaf. Now (assuming n > 1), since sm-p 2 is applicable to (snr^ ip')(u), 
Pi is connected to a leaf in VC( snr ~ i V ')( u ) - By iterating this argument, it follows 
that L is an edge-topological ordering of VC^,^ » VC u \l' with root R. 

We now prove the reverse implication. Since VC u \l' is a tree, by Theorem 1301 
there is a successful reduction ip — Lp 2 tfi of u, where ipi is a {Spr, Sdr}- 
reduction and <p 2 is a {Snr }-reduction with dom(tp2) — L' . Let L be an edge- 
topological ordering of VC u \l' with the linear component R of 1Z U as root. 
Again, by TheoremUl VC U \ L > « VC Vl(u) . 

If n > 0, then p\ is connected to a leaf C\ of PCi. Consequently, C\ has 
only vertices labelled by p\. By Lemma El snrpj is applicable to <pi{u) for 
some pi 6 By Theorem 1171 PC( snri5i Vl )( M ) is isomorphic to the graph 

obtained from PC Vl ( M ) by removing p\ and its leaf. By iterating this argument, 
it follows that snrp n snr^ n _ 1 • • • snrpj is a successful reduction of u for some 
Pi G {PiiPi} and 1 < i < n with n > 0. ■ 

Example 

We continue the example. Since (3,2,5) is an edge-topological ordering of tree 
'PCuIdx with root R, by Theorem 1341 there is a successful reduction ip = (p 2 tfi 
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C\ C3 
Figure 14: A subgraph of the pointer-component graph from the Example. 

of u, for some {Spr, Sdrj-reduction ipi and p>2 = snrg snrj snrg for some p G 
{p, p] for p G {2, 3, 5}. Indeed, we can take for example (p = snrs snr2 snrg spr=> 
sdr 4 , 6 . 

We say that two reduction rules p\ and P2 can be applied in parallel to u if both 
P2 pi and pi P2 are applicable to u (see [B])- 

Corollary 35 

Let u be a legal string, and p,i|C dom(u) with p =/= q. Then snrp and snr^ can 
be applied in parallel to u for some p € {p,p} 7 q G {q, g} iff there is a spanning 
tree T in "PC„ such that p and g both connect to leaves (considering the linear 
component of TZ U as the root) . 

The next corollary considers the case whether or not snr^ and snr^ can even- 
tually be applied in parallel. 

Corollary 36 

Let u be a legal string, and p, q C dom(u) with p =/= q. Then snr^ and snr^ can 
be applied in parallel to f(u) for some p £ {p,p\, q G {q, q} and reduction ip iff 
there is a spanning tree T in VC U such that there is no simple walk in T from 
the linear component of 1Z U (the root) to another vertex of T containing both 
edges p and q. 

Example 

We continue the example. Let D3 = {2,4,6}. Then in tree VC u \d 3 , depicted in 
Figure lLH there is no simple walk from R to another vertex of VC U \d 3 containing 
both edges 2 and 4. By Corollary |2U snr^ and snr^ can be applied in parallel 
to ip(u) for some 2 G {2, 2}, 4 G {4, 4} and reduction ip of u. Indeed, if we take 
ip = spr j sdra^, then snr2 and snr4 can be applied in parallel to <p(u) = 622446. 

10 Conclusion 

This paper showed that one can efficiently determine the possible sequences of 
loop recombination operations that can be applied in the transformation of a 
given gene from its micronuclear to its macronuclear form. Formally, one can 
determine which string negative rules can be applied in which order to a legal 
string u, given only the reduction graph of u. This is characterized in terms of 
graphs defined on the reduction graphs. Future research could focus on similar 
characterizations for the string positive rules and the string double rules. 
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